pursuit game meaning in English
追逐对策
Examples
- After introducing some basic concepts of agent 、 mas and multi - agents learning , the thesis analyses the research actuality and the future developmental directions of rl and multi - agent rl ( marl ) . furthermore , the theory and related learning algorithms of them are briefly introduced . on the basis of analyses of pursuit game , aimed at the individual action learner , the thesis extends the rl algorithm for single agent , proposes the macrl - cc algorithm . finally , aimed at the joint action learner , a team - stochastic - games - based ( tsgs - based ) framework for multi - agents cooperative rl is defined
文章首先介绍了agent和多agent系统、以及多agent学习的一些基本概念,然后介绍了强化学习和多agent强化学习的研究现状和未来发展方向。第二部分对强化学习理论和多agent强化学习理论进行了简要介绍。在对pursuitgame问题进行初步分析的基础上,针对独立行为学习者,扩展了单agent强化学习算法,提出了基于承诺和约定的多agent协同强化学习方法macrl - cc 。 - In artificial intelligence field , pursuit game is often used to test learning algorithms , and for this problem , the thesis establishes two multi - agent cooperative reinforcement learning methods ( macrl ) for multi - agents : commitment - conventions - based learning method ( macrl - cc ) and joint - action - priority - sequence - based learning method ( macrl - japs )
Pursuitgame问题常用于来测试人工智能领域的学习算法,本文就此问题提出了两种多agent协同强化学习方法:基于承诺和约定的方法和基于联合行为优先序列的方法。 - In order to solve the multi - equilibria problem in the stochastic games , a macrl algorithm called macrl - japs is proposed . these two learning methods have been justified by experiments . the main research achievements and innovations are the establishment of two macrl methods for pursuit game , which are justified by experiments
针对联合行为学习者,给出了多agent协同强化学习的团队随机博弈框架,并解决了多最优均衡解问题,提出了基于联合行为优先序列的多agent协同强化学习方法macrl - japs 。